Summarizing Newspaper Comments
نویسندگان
چکیده
This work investigates summarizing the conversations that occur in the comments section of the UK newspaper the Guardian. In the comment summarization task comments are clustered and ranked within the cluster. The top comments from each cluster are used to give an overview of that cluster. It was found that topic model clustering gave the most agreement when evaluated against a human gold standard. This approach is compared to cosine distance clustering and k-means clustering. PageRank was found to be the prefered ranking system when compared with TF-IDF, Mutual Information gain and Maximal Marginal Relevance and evaluated against sets of comments summarized by a journalist for the Guardian letters page.
منابع مشابه
Yet Another Summarization System with Two Modules using Empirical Knowledge
We previously proposed a summarization system, GREEN, for Japanese newspaper editorials. However, GREEN is not suitable for summarizing ordinal newspaper articles which are different from newspaper editorials. To participate in subtasks A-1 and A-2 of TSC (text Summarization Challenge) in NTCIR-2, we developed a new summarization system from scratch which copes with both ordinal articles and ed...
متن کاملWon’t somebody please think of the children? Improving Topic Model Clustering of Newspaper Comments for Summarisation
Online newspaper articles can accumulate comments at volumes that prevent close reading. Summarisation of the comments allows interaction at a higher level and can lead to an understanding of the overall discussion. Comment summarisation requires topic clustering, comment ranking and extraction. Clustering must be robust as the subsequent extraction relies on a good set of clusters. Comment dat...
متن کاملImproving Topic Model Clustering of Newspaper Comments for Summarisation
Online newspaper articles can accumulate comments at volumes that prevent close reading. Summarisation of the comments allows interaction at a higher level and can lead to an understanding of the overall discussion. Comment summarisation requires topic clustering, comment ranking and extraction. Clustering must be robust as the subsequent extraction relies on a good set of clusters. Comment dat...
متن کاملCoreference Resolution on Blogs and Commented News
We focus on automatic coreference resolution for blogs and news articles with user comments as part of a project on opinion mining. We aim to study the effect of the genre shift from edited structured newspaper text to unedited, unstructured blog data. We compare our coreference resolution system on three data sets: newspaper articles, mixed newspaper articles and reader comments, and blog data...
متن کاملExamining Feedback Comments on Online Auctions and Designing the Summarization Method
Bidders on net auctions write feedback comments to the sellers from whom the bidders have bought the items. Other bidders read them to determine which item to bid for. In this research, we aim at supporting bidders by summarizing the feedback comments. First, we examine feedback comments on online auctions and show the result of the examination. After that, we propose a social summarization met...
متن کامل